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This article is concerned with a mathematical tool, the Associated Transfer Matrix T, which 
proves useful in the study of a wide class of physical problems involving multilayer heterostructures. 
General properties of linear, second order differential matrix Sturm Liouville operators are discussed 
as a basis for establishing general properties of T, which is also generally related to the Green 
function G. Some identities satisfied by T are derived, which prove useful in practice to monitor 
the numerical quality of computational processes. 
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O . I. INTRODUCTION 

Z: 

■ Problems concerned with multilayer systems arise very often in different fields of physics. One has to integrate 
differential equations across some sequence of different domains, which raises the question of repeated matching at 

t-H . all interfaces involved. In particular, the development of efficient techniques of epitaxial crystal growth originated a 
great deal of work of this nature in solid state physics, notably -but not only- in the field of semiconductor quantum 
, heterostructures 1,2 . 

A great deal of activity has also been concerned with Quasiregular Heterostructures 3 which follow some non periodic 
self-replicative sequence, of which the best known is the Fibonacci sequence 4 . In these cases the number of interfaces 

■ can be very large. 

t^J- \ It is therefore of practical value to have mathematical tools available which can be useful in the study of such 
systems. Among these, the Surface Green Function Matching 1 has been extended to an arbitrarily large number 
of interfaces 5 . Green functions were generally related to a transfer matrix 6 which transfers amplitudes and normal 
derivatives -of which more will be said presently- but it was found later 7,8 that it is physically more appealing to 
I ■ introduce a different kind of transfer matrix, here denoted T and termed the Associated Transfer Matrix which is even 
\ simpler to operate with and is directly related to the physically interesting magnitudes of the problem under study. 
The purpose of this article is to expound on the general relationship between T and the Green function G, to 
discuss some basic formal properties of T and to derive from this some general identities satisfied by T which can 
• . prove useful in practice. We shall deal explicitly with the case of planar geometry, although the analysis can also be 
adapted to other tractable geometries -e.g. spherical or cylindrical. 

Irrespective of the specific physical nature of the states, modes, waves or quasiparticles we may be concerned with, 
the general setup is the following: We start from 3D linear, second order differential system and Fourier transform 
in the 2D (x,y) plane of the interfaces, thus introducing a 2D wavevector k. We then have a system of N coupled, 
/^-dependent ordinary differential equations in the variable z, in the direction perpendicular to the interfaces. We 
stress that in practice N can be sufficiently large that an analytical study is impracticable. For instance, in electronic 
band structure calculations one can find multiband envelope function calculations of electronic band structure with 
N = 8 9 and N = 14 10 and pseudopotential calculations with up to N = 65 11 . Thus, it is interesting to study in some 
detail the formal properties of an object, T 7 , which is a useful tool for numerical computation. 

The general formulation of the problem is setup in Section 2 and basic questions related to the hermiticity of linear, 
second order matrix differential Sturm-Liouville operators are discussed in Section 3. This provides a basis to study 
infinite domains and regularity at infinity -Section 4- after which the general relationship between T and the Green 
function G is established in Section 5. Finally, general properties and identities satisfied by T are derived in Section 
6. 
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II. FORMULATION OF THE PROBLEM 

The N coupled differential equations involve N coupled amplitudes corresponding to the physical model under study 
-e.g. the components of an electronic wavefunction in an iV-band envelope function model or coupled elastic vibration 
amplitudes and electrostatic potential in piezoelectric waves. Loosely speaking, we shall refer to the 'primary field', 
for reasons presently explained. 

The N amplitudes of the primary field will be condensed in an object, F(z), denoted for convenience as an 'TV- 
vector', which need not imply that it transforms literally as a vector. 

We now consider the vast class of physical problems for which the differential system can be cast in compact matrix 
form as 

L(z) ■ F(z) = + Y(z) ■ + W(z) ■ F(z) = , (1) 

az az 

where we have defined another N- vector, namely 

A(z) = B(z) • ^ + P(z) ■ F(z) . (2) 

We denote this linear differential form, derived from F, as the secondary field. For instance, if F describes elastic 
strains, then A describes stresses. More precisely, normal stress components 1 . Furthermore, we are concerned with 
eigenvalue problems. The eigenvalue parameter, henceforth denoted ft, is contained in W. The dependence on k and 
51 is understood throughout. 

The z dependence of some, or all, of the (matrix) coefficients of (1) and (2) is an essential feature of heterostructures, 
even if piecewise homogeneous domains are involved. 

Now, F(z) is obviously continuous everywhere and, furthermore, integration across any of the interfaces of the 
heterostructure proves that A(z) is also continuous, which bears out the convenience of this compact formulation. 
Since both, the primary field and the secondary field are continuous, this suggests the convenience of defining, instead 
of a transfer matrix M 1 ' 6 which transfers F(z) and F '(z), a different one, here denoted as the Associated Transfer 
Matrix (ATM) T, which transfers the two fields that are continuous everywhere. Thus we define the 2N- vector 



F(z) 
A(z) 



(3) 



and then, starting from any initial point z , we define T(z, z ) which transfers * from z to any z by 

*(z)=T(z,z )-*(z ) . (4) 

Unlike the coefficient transfer matrix, often used for systems with piecewise constant coefficients 12 ' 13 , M(z, z ) and 
T(z, zo) do not depend on knowing a basis of 2N linearly independent solutions. Both, M{z, zq) and T(z, zo), pertain 
to initial value problems, while the Green function G(z, z ') depends on the choice of boundary conditions. Thus, for 
a given differential system M and T are unique, while G is not. The relationship between G and M was discussed 
in 1 ' 6 ' 15 . Here we discuss the relationship between G and T, for which we first study the general properties of the 
latter, for which first some observations concerning the differential system are in order. 

III. THE HERMITICITY OF THE DIFFERENTIAL OPERATOR 

The differential matrix L(z) of (1) is the representation of a linear differential operator which we now discuss. We 
recall 14 that the full definition of the operator requires the specification of the manifold S of functions -here termed 
'vectors' F (z)- on which it acts. 

Let p be an operator represented by the operating rule 
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d 

p = -i— 

dz 



(5) 



So far, p is formally Hermitean 14 , [Chapter 3]. Now let L, represented by the operating rule (1), be an operator 
defined to act on a given manifold S and let F denote the generic function of this particular manifold. We write the 
system (1) explicitly as 



L( Z ).F(z) = ± 



B(z). d ^ + P(z).F(z) 



+ 



dFiz^ + 

dz 



(6) 



Thus, the operator L represented by this equation is 



L = -p[B-p + iP-\ -%Y -p+ W(z)- , 



(7) 



where the dots indicate that, as a matrix, this is to act on vectors F. Furthermore, in most physical problems related 
to heterostructures the matrix coefficients satisfy the conditions 



B = B ] , P = -Y" f , W = W ] . 



(8) 



Thus, for the large class of problems we are concerned with we assume (8) to hold. Then it is easily seen that the 
Hermitean adjoint of L is 



(9) 



which so far only ensures that L is formally Hermitean. 

We now, more specifically, define S as the manifold of functions defined in a given interval [a, b] -provisionally 
assumed to be finite- and satisfying given boundary conditions at its extremes. A physical example could be a 
multiple quantum well with infinite barriers -vanishing amplitudes- in the external media. Let us then consider 
another operator L2, acting on another manifold 1S2 of functions F2 defined in the same interval [a, b] but otherwise 
satisfying so far unspecified boundary conditions. 

We define L 2 to be represented by 



L 2 {z) ■ F 2 {z) 



dz 



(10) 



and then at every point z of the interval [a, b], by assumption common to S and S2, we define the residual 1Z{z) by 



K(z) = 



+F\z) ■ P\z) ■ F 2 (z) + F\z) ■ Y\z) ■ F 2 {z)] 1 . 
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We then study the integral 



(F 2 \LF)= I dz F\{z) ■ L{z) ■ F{z) (12) 

which, after partial integration, is 



(F 2 \LF) = (F\L 2 F 2 y + K(b) - K(a) . (13) 

Now, since the matrix coefficients satisfy the conditions (8), the operating rule for L 2 is the same as for L, whence 
follows that the residual is 



ll(z) = Fl(z) ■ A(z) - Al(z) ■ F(z) (14) 

and thus the residual difference appearing in (13) is 

K(b) - 11(a) = F\(b) ■ A(b) - A\(b) ■ F(b) 

-F\(a) ■ A(a) + A\(a) ■ F(a) . (15) 

The mathematical definition of 72 acquires a physical meaning in terms of the primary and secondary fields, the two 
objects transferred by T. 

This is the central point of the analysis. The operators L and L 2 are not yet fully defined, as the boundary 
conditions have not been specified. If we assume only that <S and 52 are such that the combined effect of their 
boundary conditions is such that 72.(6) — 72(a) vanishes, then 



(F 2 \LF) = (F\L 2 F 2 y (16) 

and L12 is the Hermitean adjoint of L. 

If, furthermore, the boundary conditions are the same separately in S and 52, then £2 -the Hermitean adjoint of 
L- is also equal to L and it is then that L is really Hermitean, that is to say, technically, it is totally Hermitean 14 . 
Such is the case, for instance, for electronic states with external infinite barriers, where all amplitudes vanish, or for 
elastic waves with external free surfaces, where all normal stresses vanish. 

It seems reasonable to put forward this type of analysis to justify the basis of standard calculations for physical 
models with N > 1. Furthermore, this is the starting point to analyse a question directly relevant to the physical 
models employed in practice to study heterostructures. 



IV. INFINITE DOMAINS: CAUSALITY AND REGULARITY AT INFINITY 

This situation arises if the external media are semiinfinite or, simply, if one studies a bulk medium. So far it has 
been implicitly assumed that the eigenvalues are real, which is formally correct if the interval [a, 6] is finite. We 
must now analyse the situation when this is infinite, for which a brief reminder of some basic facts is at this stage 
convenient. In order to fix ideas it suffices to consider a simple case with N = 1. 

Firstly, we must define a sign convention for the description of stationary amplitudes F(z,t). We adopt the 
convention 



F(z, t) = F e^**-"*) 



(17) 
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for a wave travelling to the right. Then, when in the time dependent picture t — > oo, in the stationary state time 
independent amplitude, 

F(z) = F e lkz , (18) 

the variable z — > oo. In order for this to be a physically acceptable regular solution, it cannot blow up for z — > oo, which 
requires formally (i) that the real wavevector k, corresponding to the allowed eigenvalue of a stationary propagating 
eigenstate, be defined as 

k — lim(fc + ie) (19) 
and (ii) that we define the limit of F(z) for z — > oo as the regular limit. 

lim lim F e i{k+ie)z = . (20) 

The same analysis holds obviously for waves travelling to the left and z — ► — oo and the same basic facts hold for less 
simple cases, e.g. for any N > 1. 

Let us now return to the standard operator L on which the analysis of the class of problems here considered is 
based. Since this is defined to act on the manifold S of functions which are regular at infinity, the second and fourth 
terms of the residual difference (15) vanish. However in order for (15) to vanish entirely, the condition on the adjoint 
of L is that the boundary conditions of regularity at infinity must hold for the F\, not the F 2 - In fact, in order for 
(15) to vanish the F 2 must be precisely irregular at infinity, as only then the F\ are regular. Therefore the manifold 
52 is not the same as S and we reach the non trivial conclusion that L, the operator normally used in the description 
of infinite -or, in the event- seminfinite regular media is not totally Hermitean. 

Although a full analysis in general terms ( 16 , §7.5) of the conditions at infinity is outside the scope of this article, it is 
in order to recall that while regularity at infinity is the physical requirement in the stationary state -time independent- 
picture, causality is the condition required in standard physical theory, and this in turn requires u to be defined as the 
limit of (uj + if}) for 77 — > 0. Now, for the class of problems here considered id bears a direct relation to the eigenvalue 
variable ft. For instance, for electronic states O is the energy and id is fl/h; for most wave problems id is SI 1 / 2 , etc. 
Then the causal formulation amounts to requiring that ft is to be read everywhere as 

Q, = lim(Q + irj) . (21) 

Even for bound states, for which the eigenvalue is real, a small imaginary part may have to be added in practice, 
for instance, when calculating spectral functions numerically with a computer code. The point to stress is that (21) 
must be kept as such everywhere and this has a direct bearing on the foregoing analysis. 

It is correct to define t\ the Hermitean conjugate of an operator L as has been done, but in the matrices resulting 
in the z-representation, where f2 appears explicitly, it must always be kept + irj, whether r\ stays small but finite or 
it tends to zero as in (21). In other words, while the standard Hermitean conjugate of a matrix of the general form 

m(n) = mi(fi) + im 2 (Vt) (22) 

is 



m f (fi) = m\(n*) - imi(n*) , 



(23) 
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this is not the concept required for an explicit matrix analysis of the problems here considered. The concept of 
transconjugate matrix 

m c (n) = m\(Q) - irn^fO) (24) 

was used in 6 where the relationship between the standard transfer matrix M and the Green function G was established. 
This was further developed in 1 . If one used m^, instead of m c , then one would obtain a negative density of states. 
We now relate G to T, defined in (4), while extending significantly the analysis. 

V. THE RELATIONSHIP BETWEEN G AND T 

We now start from 

L(z)-G(z,z') = I N 6(z-z') , (25) 

where In is the N xN unit matrix and L(z) is the differential matrix of (1) but, instead of A(z) of (2), the differential 
form is now 

A(z, z ') = B{z) ■ dG{ Q z Z ' ] + P(z) ■ G(z, z ') . (26) 

Defining 

A^z) = lim A(z,z'), (27) 
a first integration of (25) yields the identity 

A + (z) - A~(z) = -I N . (28) 

Furthermore, a detailed formal analysis of (25) shows 16 that if we define 

Z(z, z ') = dG{ d y' ] ■ B\z ') + G(z, z ') ■ P\z ') (29) 

and hence 

± Z(z) = lim Z(z,z') , (30) 

then there is another identity, namely 

+ Z(z) - Z{z) = -I N . (31) 
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It is now convenient to display T in the form 



Taa 


Tad 


Tda 


Tdd 



(32) 



where the submatrices T a p (a, (3 = A,D) are N x N. Then, similarly to (3), we define the rectangular 2N x N matrix 



T(z,z') 



G(z,z>) 
A(z,z') 



and thus 



(33) 



T(z,z') =T(z,z )-T(z ,z') , 
that is to say, for fixed z ' T(z, z ') is transferred like *&(z) and, in particular 

G(z,z') = T AA (z, z ) ■ G(zq, z') + T AD (z, z ) ■ A(z , z') . 



(34) 



(35) 



A similar analysis can be carried out with the Green function of the Hermitean conjugate differential system, which 
has the same ATM. This leads to a formula analogous to (35) in which T c (z ', z ), acting from the right, transfers in 
z 1 with z fixed. 

Following this line of argument and recalling that the definition of T does not depend on the choice of zq, we arrive 
-details in 16 - at a general expression for G in terms of T of the form 



G(z,z') 



J2 a /3 T Aa(z, Z ) ■ C<p ■ T° Af3 (z ' , Z ) Z<z' 



Ea/3 T Aa(z, Zq) • C>p • T C Af] (z ' , Z ) Z> z' , 
for any pair of values of (z, z ') irrespective of the position of z . Hence 



(36) 



A(z,z') 



J2 a /3 T Da(z,Z ) ■ C<p ■ T C A/3 (z ' , Z ) Z < Z 1 

Eafl T Da (z, z ) ■ C>p ■ T c Af5 (z ',z ) z> z' . 



(37) 



Not all the eight (matrix) coefficients C^p, C^p are independent. Firstly, whether we proceed from the first or the 
second row of (36), we must obtain 



g(z) = G(z,z) 



Caa 



Caa 



■AA 



(38) 



Then, from (28) and (37): 



^DA ^DA 



-I 



N ■ 



(39) 
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Furthermore, an expression for Z{z, z ' 



) (29) can be obtained similar to (37) and then, from (31) 




(40) 



Now, consider the configuration z < z n < z '. Then 



G(z,z') = G(z,z )-[g(z )]- 1 -G(z ,z') . 



(41) 



From this and the above equalities -together with the similar expressions for Z(z, z ')- we obtain 




AD i 



AD ■ 



(42) 



These relationships hold quite generally and reduce the number of independent coefficients. The rest of the analysis 
depends on the domain under study. 

For the finite internal domains of a heterostructure, it was shown in 15 that one can define any arbitrary extended 
pscudomedium with a Green function satisfying arbitrary boundary conditions -except for infinite barriers- at the 
extremes of the said domains. In practice a considerable simplification of the algebra can be achieved in this way. The 
external domains are different. Sometimes they are terminated by infinite barriers, which is part of the model. The 
algebra is then fairly simple. A different situation arises if the external domains are assumed to extend to infinity. 
We now consider an infinite regular medium, which is in any case always a valid description for any domain, even if 
it is finite or semiinfinite. 

We take z = and abbreviate T a p(z 7 zo)/T a p(z ', z ) as T a p(z)/T a p(z '). Consider the first row of (36) and take 
z ' = 0. Then 



Regularity at — oo requires the vanishing of the regular limit of (43) . To this effect we note that even if the coefficients 
of the differential system are variable, for sufficient large \z\ these can be asymptotically 'flattened' -i.e. replaced by 
their average values- at sufficiently large distances without essential loss of accuracy. Then T can be written as T(e, z) 
and regular limits taken in the manner of (19)-(20), so we define the regular limits 



Now, the first factor of (43) has asymptotic oscillatory behaviour. Thus the regular vanishing of G(— oo, 0) requires 
the vanishing of the second factor in the regular limit. Following this line of argument and using the general identities 
just presented, we obtain the full set of parameters for the regular Green function 




(43) 




(44) 



C< 



DD — 




— i 



(45) 
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The regular Green function is thus fully determined from T. The latter is always an ultimate resort if the analytical 
solution is not viable, while G is a natural object to extract physical information directly from it. As indicated 
above, the set of parameters C^g, C^g can be considerably simplified by the convenient choice of simpler boundary 
condition. 

VI. THE SYMPLECTIC CHARACTER AND GENERAL PROPERTIES OF T 

Firstly we note that, due to the fact that the two fields transferred by T are continuous everywhere, on crossing 
interfaces separating different media T is obtained by simple chain multiplication of the corresponding different 
matrices. This is a practical useful feature for the description of heterostructures, not shared by iW 15 . Furthermore, 
let us define the 2N x 2N auxiliary matrix 



J= = 



On —In 
In On 



(46) 



Then 



* c • J • * = -F c -A + A c -F 



(47) 



Now, the general definition of the flux density is 



j(z) = i [F\z) ■ A(z) - A c (z) ■ F(z)} = -i<S> c {z) ■ J ■ *(z) . 



(48) 



On the other hand, the second order differential system for F is transformed in the standard way into the first 
order differential system for 



dz 



= D{z) ■ V(z) 



(49) 



with 



D(z) 



-B- 1 (z)-P(z) B-\z) 
\Y(z) ■ B-\z) ■ P(z) - W(z)] -Y(z) ■ B-\z) 



(50) 



It is easily seen that 



d 

dz 



[F c ■ A - A c ■ F] = -* c [D c J + J D] * . 



(51) 



Then, since 



D c J + J D = 2N 



(52) 
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it follows that j(z) is constant everywhere, which is a general expression of the continuity equation. 

Now consider any arbitrary pair of points (zq, z), transfer from z to z, write T(z, zq) for brevity as T and equate 
j(z) to j(z ). This yields 

J = T C J T (53) 
Thus T is a symplectic matrix in a general sense 17 . Since J has unit determinant, it follows that 

|Det[T]| 2 = l (54) 

This provides a concise practical rule which can be used to monitor the quality of the numerical computations as 
z grows away from z . Furthermore, by equating the submatrices of (53) we obtain the three independent identities 



rpc 

1 AA 


• T DA 


rpc 

1 DA 


■ T AA 


= On 


rpc 

1 DD ' 


Tad 


rr~iC 

AD 


Tdd 


= N 


rpc 

1 AA ' 


Tdd 


rpc 

~ 1 DA 


• Tad 


= In 



(55) 
(56) 
(57) 



These can also prove useful in two ways, namely: (i) they can help simplify the algebra in the analytical formulation 
of a problem and (ii) other concise rules for numerical monitoring can also be extracted from them. 

Further details and different ways to combine T and G for the study of multilayer heterostructures are given in 16 . 



VII. CONCLUSION 



The Associated Transfer Matrix T appears to be both, formally appealing and practically useful. It suggests itself 
in a natural way from the physics of the problem. It is unique and does not depend on having an analytical basis, 
so it can always be an ultimate resort, obtainable by numerical integration. It satisfies some general identities, not 
satisfied by other transfer matrices, which provides useful rules to monitor the numerical quality of computational 
processes. Finally, it is directly related to the Green function, from which direct physical information follows in a 
natural way. 
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